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ABSTRACT 



The control unit arbitrarily selects a disk unit among the disk 
units that are inactive when the control unit receives an 
input/output request involving either read or staging. For a 
write request, the control unit selects a master disk unit in the 
disk unit group for the immediate writing of data, and a disk 
unit other than the above-mentioned master disk is prefer- 
ably selected to execute the read and staging unit After 
writing is used to write the data into the other disk units after 
the write request execution is completed by writing to the 
master disk unit 

47 Claims, 15 Drawing Sheets 
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DATA DISTRIBUTION UTILIZING A live disk unit by the control unit for an input/output request 

MASTER DISK UNIT FOR FETCHING AND from the CPU. However, this is applied to the doublewrite 

FOR WRITING TO REMAINING DISK function by utilizing a cache without a write after control, 

UNITS and therefore it's reliability is lowered. This is due to the 
. 5 high possibility that write data is received from the CPU that 
This is a continuation application of Set No. 07/643,998, is applicable to all the disk units, but is stored in the cache 

filed Jan. 31, 1991 now abandoned. wimout betog immediately written to a disk unit Therefore, 

t» x ™r ~. „ if power failure occurs in the cache in combination with the 

BACKGROUND OF THE WVEtmON breakdown of any one of the disk units, te^TdS 

The present invention relates to the control of the execu- 10 receivcd from ^ is lost 
tion of a load for a storage, particularly the control of parallel Furthermore, in the case of a control unit having a cache, 
processing with respect to input and output for disk drives. mc contro1 unit can execute an input/output between the 
In Japanese Patent Laid-Open No. 114947/1985 a double Cachc unit ^^ndently of an input/output 

write control has a cache (hereinafter referred to simply as ,« ?f CSt **** C ? U \ as disclosed ^ thesis in the 
a cache). Two disks, called double write disks arTeach formation Process Institute Bulletin, mentioned above. In 
written with the same data. A control unit processes an input ™ W ^ thc n mventors ^nk that attention should be 
output request from a CPU for one of the two disk units In & ™ ? ? C P 055 * 51 ^ a P luraHt y of ^ units can be 
mccaseofrecdvmgareadrequest(mput rc quest)fxomm * A , an input/output process by the control unit 

CPU, the control unit executes the request as it is. In the case 20 mdc P CIldcntl y of an "5»t/cotput request from the CPU. 
of receiving a write request (output request) from the CPU, Japanese Patent Laid-Open No. 135563/1984 does not 
data is written in a specific one of the double write disks and have any rclation t0 toe double write system. This patent 
at the same time the same data is written in the cache. At a relates to toe cache disk control unit with a write after 
later time, making use of available processing time when the contr °L The disk control unit stores the write data received 
control unit and disks have nothing else to do, the control 2 s ^° mtht CPU 10 botn me cacne memory and the non-volatile 
unit writes the same data from the cache into the other disk memoi y- The write data in the cache memory is written to 
unit, which is called a write after process. In this manner, the me . ^ unit h ? utilizing a write after process. Therefore, the 
same data is written to each disk unit of the double write disk writc rc ^ ucst ^ucd by the CPU can be processed at high 
units. speed without accessing the disk unit, moreover, this can 

In Japanese Patent Publication No. 28128/1986, there is 30 ^f? * C Process. If the write 

disclosed a double filing control for load distribution with TS™ ^V* 4 ^ raem °7 15 lost because <* ^ breakdown 
respect to double write disk units. There is no write after memory, ^ ^ te *** remains * the non- 

process. The control is designed to achieve a higher pro- nie ? 0 ? However > P** does *<* relate to the 

cessing speed by selecting an inactive disk unit, among the S^Sf* f^Tt Laid 0pen Na 

disk units, when an input/output request is receive/ An 35 t0 write system deluding the 

inactive disk unit will be defined herein as a disk unit that is "f 1 ^ ^J.T 1 w,tt l a controL ^is 

currently not undergoing any disk accessing, that is not te **** ^ k °° ntro1 *** ™ P M0TmA ' 

undergoing any read or write operation. *° n to ^mze the area in the disk unit where the write 

Tn * th^cic f mn A tr\ tt% t f ^_ , . data in the cashe has not been written yet The bit map 

* ^ J u- ^ , ^ omiat ">a ^occss Institute information is utilized to reduce the recovery time when the 
f££2^ 40 ca ^^r,eaksdown.However,thispS 

IS^^^^^S^a 9 Z ' St' % 10 ^ l0ad among the disk units in the disk unit 

313-320(1984), there is disclosed with respect to a single group. 

disk unit a read ahead control having a cache, which c™;««n„ *. 

involves the staging, in the cache, data notrequested by the r^^^T^^^^^^t * ^ 0001101 

CPU but which will be requested in an instruction shortly « ^^^ H ^^ su ^^ m ^'^ 

following the current inaction. TTus staging process I ^7JlTT^ ^ T COn * risiD S one or 

ewcutodlylheccmtrdu^^ ^ T > «mt gn^p comprises one disk 

of an input/output request fromthe CPU ?* ^ has a P laraU * of 0D cach <* w ^ 

is written the same data. If the disk unit group comprises a 

SUMMARY so P}™^^ of units, each disk unit may have one or more 

disks, with the same data being written to each disk unit of 

lit is an object of the present invention to solve problems, die group, 

analyzed below, that the inventors have found with respect The object of the present invention is to provide control 

to the above-noted controls. for improving parallel execution of input/output processes 

Japanese Patent Laid-Open No. 114947/1985 does not 55 bv distributing the processes among disk units in the disk 

give any attention to a potential advantage of the double umt group, for distributing the load of the input/output 

write disk system, namely that a plurality of disk units can processes under the control of the control unit 

be controlled by the control unit, but instead the document To better understand the present invention, input/output 

discloses that the CPU input/output request is limited to one processes, which the control unit executes between the 

specific disk unit as requested by the CPU. Therefore, even eo control unit and the disk units can be classified into four 

though there is another disk unit that may be inactive, the kinds, as follows: 

request cannot be fulfilled by the control unit if the CPU (1)A write request received from the CPU, whichrequires 

requests the one specific disk unit that happens to be active access to a disk unit 

L^^^^^'T- ^ ^ WhCn lt ^ <2>Areadrequestreceivedfromto 

undergoing some type of input/output process. & access to a disk unit 

nnvwfi!** h ^ / a ^ CSe Pat t nt P ^ bli . Cation No - < 3 > A process performed independently of an 

28128/1986 has excellent performance by selecting an mac- input/output request from the CPU (that is indepen- 
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dently of a read request or a write request from the mentioned documents. As compared with the control dis- 

CPU), which transfers the data from a disk unit to a closed in Japanese Patent Laid-Open No, 114947/1985, the 

cache. first type of distribution according to the present invention is 

(4) A write after process executed between the control unit slightly inferior in the distribution effect far the write 

and a disk unit 5 request, but as compared with the control disclosed to 

Of the above mentioned four kinds, the write after process Japanese Patent Publication No. 28128/1986, the present 

is not an object for load distribution, as will be explained invention provides superior and excellent performance. The 

later. The write after process is executed, with respect to a first type of load distribution according to the present 

disk unit group, for all of the disk units other than those to invention has a restriction with respect to the free selection 

which the same data has already been written, when a write 10 of the disk unit for a write request. Accordingly, the distri- 

rcqucst received from the CPU is executed. Therefore, there buu'on effect is lower as compared with the control of 

is do freedom for selecting a disk unit which should be used Japanese Patent Laid-Open No. 1 14947/1985 that can select 

to execute the write after process. Therefore, in the above any disk unit within the disk unit group. However, far a read 

four processes, the first three processes are objects for load request, any inactive disk unit is selected by the present 

distribution. 15 invention. Usually, there is a tar greater number of read 

In the present specification, two kinds of load distribution requests than the number of write requests, for disk units in 

according to the present invention will be discussed. general, and the ratio is approximately 3: 1 to 4: 1. Therefore, 

In the first kind of load distribution, the control unit the first load distribution type shows not so large a degra- 

selects a disk unit among the disk units that are inactive dation in the performance as compared with the control 

when the control unit executes an input/output process 20 disclosed in Japanese Patent Laid-Open NO. 114947/1985. 

involving either the second (read) or third (staging) kind of On the other hand, as compared with the control disclosed 

process. When a disk unit should be selected for a write in Japanese Patent Publication No. 28128/1986, which uses 

request from the CPU, which requires access to the disk unit one disk unit intensively for all input/output requests, the 

accordUng to the first type of the four mentioned input/output first type of load distribution according to the present 

processes, the control unit selects a specific disk unit in the 25 invention shows a far better performance, 

disk unit group for the immediate writing of data. The reliability of the first type of load distribution accordV 

In the second kind of load distribution, when the control ing to the present invention is higher than the reliability 

unit selects a disk unit for an input/output process of the first provided by the disclosure of Japanese Patent Laid-Open 

type, that is for the write request received from the CPU No. 114947/1985, and is almost equal to that of the method 

which requires access to a disk unit, a specific disk unit in 30 disclosed in Japanese Patent Publication 28128/1986. For 

the disk unit group is selected. When a disk unit is selected the first kind of load distribution according to the present 

to execute an input/output process of the second and third invention or the Japanese Patent Publication No. 28128/ 

types (read and staging), a disk unit other than the above- 1986, there is no data for the write after process for the disk 

mentioned specific disk unit is selected, preferably arbi- unit for which write requests are intensively assigned. The 

trarily. 35 write after process does not write data to any specific disk 

The functions of the first kind of load distribution will be unit for which write requests are intensively assigned, 

discussed. When a control unit receives from the CPU a read Therefore, even if there is a power failure in the cache, no 

request which requires access to a disk unit, the control unit write data received from the CPU is lost unless the specific, 

executes the following process. For the read request, the master disk unit intensively storing all of the write requests 

control unit selects arbitrarily (that is independently of the 40 is also damaged. If, according to Japanese Patent Publication 

CPU, which includes according to an algorithm imple- No. 28128/1986 me write request was immediately executed 

mented in the control unit), a disk unit among the inactive for a random one of the disk units and the write data was 

disk units in the disk unit group (each of the disk units in the saved in the cache for a later write after. Therefore, if the 

disk unit group has on it the same data to be read). If no cache lost its data before the write after could be completed 

inactive disk unit is found among the disk units of the disk 45 and any one of the disks in the disk unit group is damaged, 

unit group, the control unit will place the read request in a the data could be completely lost Whereas in the present 

wait state. In the case of receiving a write request from the invention, the write request is always immediately executed 

CPU requiring access to a disk unit, the control unit selects with respect to one specific disk, a master disk, so that even 

one specific disk unit, hereinafter called the master disk unit, if the data is lost in the cache before the write after process 

among all the disk units of the disk unit group. If the specific 50 can be completed, the data can be read from the master disk 

disk unit, particularly the master disk unit, is active with reliably. Accordingly, the load distribution of the first type 

respect to some other input/output process, the control unit according to the present invention has high performance and 

will place the write request in a wait state. In the case of high reliability with respect to a disk unit group, in a well 

executing a staging performed by the control unit indepen- balanced manner. 

dently of an input/output request from the CPU, an inactive 55 The function of the second type of load distribution, 

disk unit among the disk units of the disk unit group is according to the present invention, will be discussed 

selected for the staging, that is for transfer of information When the control unit receives from the CPU a write 

between the disk unit and the cache. If all of the disk units request requiring access to a disk unit in the disk unit group, 

subject to such a selection are active with some other the control unit selects one specific disk unit, hereinafter 

input/output process, the control unit places the staging in a 60 called the master disk unit, among all the disks units of the 

wait state. disk unit group for immediate execution of the write request, 

In general, an input/output process placed in a wait state and also writes the same data to the cache for later execution 

will be periodically reviewed to see if it can be executed, and of the write after process. However, if this specific disk unit, 

if it can be executed, it will be executed. the master disk unit, is in an active state, the control unit 

The first type of a load distribution according to the 65 places the write request in a wait state. When receiving a 

present invention has improved reliability and improved read request from the CPU requiring access to a disk unit in 

features, with respect to the control disclosed in the above- a certain disk unit group, the control unit executes the 
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t^^^F^fl^ ( aAitau y with respect FIG. 7 is a flowchart showing independent stagine with 

to the CPU and selectable according to random distribution respect to the second kind of load dictation STto 

or some algorithm by the control unit) disk unit in an the present invention; «™on »«»*«« »° 

inactive state is selected from among the disk units of (he S~ ° -,. ^ 

disk unit group other than the above-mentioned specific disk 5 illustrates the structure of a disk unit; 

unit, that is other than the master disk unit. That is, the read 9 illustrates the structure of a track; 

request is performed with respect to any of the disk units of PIG. 10 illustrates the structure of a cache- 

the disk unit group except for the master disk unit If no vrn « -u . , ' 

inactive disk unitis found among the disk units ofcer to ^ D0CCSMry Prided in a 
this master disk unit, the master disk unit is then examined 

to determine whether or not it is inactive. If the master disk 10 shows the segment management inf ormation for 

unit is inactive, as determined by such examination, the me P*™** 1 invention; 

control unit selects the master disk unit to complete the'read FIGS. 13a and ttb show the storage format for a record 

request, and if the exarnination reveals that the master disk on a track in a segment unit* 

When attempting to execute a stage process indepen- . , Ub iUl " tiates ^ structure of an empty segment que 

dently of an input/output request from the CPU, the control ncad P°"*er, 

unit performs the following process, for the second load 15 snow s the information stored in a control infor- 

distribution kind in the present invention. First one arbitrary mation memory; 

20 m 16 mustratcs ta structurc of a disk unit group 
unit group other than the master disk unit If do inactive disk information* 

unit is found for such selection, the master disk unit is mo i<rm ^ . - 

examined to deterrnine whether or not it is inactive. If this illustrates the structure of a disk unit information; 

determination finds the master disk unit inactive, the control ^ illustrates the module of a director; 

unit selects the master disk unit for execution of the staging, „ FIG. 19 is a flowchart for input/output request reception- 
and if the examination finds that the master disk unit is rnn ->n , a™ t, ^ r ~ * 

active, the control unit places the staging wak sTte ™* * ? * fl ° Wchart ^ * ^ ^ P r ^^ing; 

The reason why the second load distribution kind accord- flowchart for an independent staging; 

ing to the present invention is more desirable than the first FIG * 22 " a flowchart for a disk unit read write process; 
load distribution kind is as follows. As an example, let it be 30 

assumed that a read request is assigned to a specific disk unit FIG. 23 is a flowchart for an inputtoutput request recep- 
for which write requests from the CPU are intensively according to the second kind of load distribution of the 

assigned, mare specifically, the master disk unit, by the first present invention, 
load distribution kind. If a write request is received before 

the process for the read request is completed, the control unit DETAILED DESCRIPTION 

cannot start executing the write request. Therefore, the disk 35 All of the following figures and description acolv eauaUv 
units ofcer than the master disk unit should preferably be to the first type of load istribuUon 2 SE !£2 
mTcPU ^^^^dd'^U^ WdtC r 1 " 0 ? fr ° m Mention and me second 

Ae CPU. Thus the load distribution effect can be enhanced to the present invention, except where the differences^! 
£ the second type of load distribution of the present „ specifically disclosed witn respect to the s^on7t^ 0 no!d 
myenton as compared^ « distribution that is a modification of iL $KE2 

and as compared to the load distribution of the above- distribution 
mentioned documents. _^ . . ' t 

FIO. 2 is a block diagram showing the configuration of a 
BRIEF DESCRIPTION OF THE DRAWINGS computing system of the present invention. The computing 

Further, objects, features and advantages of the present 45 s y stOT con ¥™e^ aplurality of processors 210, each having 
invention will become more clear from the following 3 CPU 200, a main memory (MM) 201 and channels 202; a 
detailed description of a preferred embodiment of the ? ontro1 unit 2031 a P 1 ™"^ of disk units 204 grouped 
present invention, with reference to the accompanying lcsscr P 1 " 1 * 1 ^ °f <H*k unit groups 211. In this respect, 

drawing, wherein: become clear from the following description that the 

FIG. 1 illustrates the basic operation of a control unit with 30 2™*°* Mention is applicable to the control unit 203 
respect to a first kind of load distribution according to the f 01 ^^ t0 a s^glc or a plurality of processors 210, as 
present invention; indicated. There are a plurality n of disk units 204 grouped 

FIG. 2 is a block diagram showing the configuration of a 2£ ^117^ m ° f ^fl™* 2U ' * 
computing system of the present invention* each of the m disk unit groups 211 has more than n disk units 

FIG. 3 shows parallel processing for 'an input/output 35 22l£S^ 
process received from a ^ocessor and an inpS o^k ^ hZLc ^ ^ ^ 1* ^ V °° pi ' 

mput output requests received respectfully from a plurality description 
of processors; _ 

FIG. 5 shows parallel processing between a plurality of 20?! ^Xf^ ™ ° DC director 

input output requests received froi a Zfr 'i^S? Dl5.M^ H ! VOkti,e as a 

T7TO /,i _\ t , . piwcaaui, ^RAM) 206, control informaUon memoir 207 and directnri; 

nG.Silh^atesmebaskparaUdoperationofthea.ntrol 65 208. Each of the directors 205 selec^io^ 

umt operatmg wtfl, respect to the second kind of load between a channel 202 and a disk SaJ^Sw a 

dtstabutron, accordmg to toe present invention; channel 202 and the cache 206, and betweenfte S 



06/02/2003, EAST Version: 1.03.0002 



5,680,574 

7 8 

and a disk unit 204. The cache 206 stages data that requires In FIG. 1, the disk unit group 211A, the disk unit group 

more frequent access, which data is also stored along with 211B and the disk unit group 211C are connected to the 

other less frequently accessed data in the disk units 204. The control unit 203. The disk unit group 211A comprises a 

directory 208 stores information needed to manage the cache master disk unit Ao and a plurality of disk units Al through 

2#<J. The staging is executed by the directors 205. A specific 5 aj« which can vary in number from one to many disk units 

example of staging data can be the object data accessible ana * wm ch are disk units other than the specific unit, that is 

from the CPU 20, and the stared data in the vicinity of this other me master disk unit Ao. Likewise, the disk unit 

data in the disk unit 204. group B comprises a master disk unit Bo, and a plurality of 

The control unit 203, to which the present invention is omer un jts Bi through Bj, and the disk unit group C 

specifically directed, has the function to write the same data 1Q cora pri ses a master disk unit Co and other disk units Ci 

to all of the disk units 204 belonging to a certain disk unit through q. 

group 211, that is, the so-called multiple writing function. The ^^0^ requests received by the control unit203 

Therefore, it can be considered that the processor 210 issues from ^ ssor 210 mat ^ access t0 a ^ uni t 211 

an inputfoutput request selectively to each of the disk groups wm ^ ^ saibed separatdv for mc write and read requests. 

211 - . „ . . 4 . . 15 In FIG. 1, data flow 110 is for a write request, and data flow 

From the viewpoint of the control unit 203, acceptable a ^ t Thc naft 203 ft 

mpuyoutput processes from the processor 210 can be clas- ^ m ^ ^ pmccsSQI 210 whidj requires 

sifted as follows, access to the disk unit group 211A The control unit 203 

(1) An input/output request E^.^'^^S™ selects the master disk Ao within the disk unit group 211A 
access to a disk unit 214, which is a request for data °\. r A . 
*a^fer between the cache 206 and the processor 210. » with jelechon (a In other wad* the master disk AO is 
For example, the process executed when the data for a regarded as the disk unit in which the write quests which 
read request received from the processor 210 has require an access to the disk unit group 211A are concen- 
already been staged in the cache 206 as shown by the trated. According to the more limited aspect of the present 
cache directory 208. invention, all of the write requests immediately transfer 

(2) A process involving input/output request from a CPU 25 write data to the masterdiskAo and the cache 206. The same 
requiring access to a disk unit 204 in a specified disk write data is later transferred by the cache to all of the other 
unit group 211. disk units of the disk unit group 2UA by the write after 

(3) Execution process by the control unit 203 between a process. The control unit 203 writes to the master disk unit 
disk unit 204 and the cache 206, whereby the control Ao the data of the write request received from the processor 
unit 203 executes a subsequent input/output process 30 210 and at the same time writes this write data to the cache 
independently of the input/output request received 206 as write data HI. The control unit 203 later executes 
from the processor 210. For example, a data transfer writing this same write data 111 to each of the disk units 
without any relationship to the processor 210, which is belonging to the same disk unit group 211A other than the 
an input output process between the cache 206 and a master disk Ao, that is to the disk units Al through Al with 
disk unit 204 in one of the disk unit groups 211. 35 selection (b) and write after process flow (c). The write data 

The present invention relates to a load distribution control 111 is stored in the cache 206 until all of the write after 

between the disk units 204 in the same disk unit group 211. processes to each of the disks Al through Ai are completed. 

Therefore, the input/output request process that does not In the case of receiving the write request which requires 

gain access to a disk unit group 211 referred to as (1) above access to the disk unit group 211A, the reason why the 

is not directly related to the present invention. Among the 40 master disk Ao is preferably always selected for immedi- 

processes executed by the control unit 203, the processes ately receiving the write data is as follows: If it is so 

mentioned above as (2) and (3) are the objects of the present arranged that any write request must necessarily be assigned 

invention. In this respect, the disk unit 204 to which no to the master disk AO, all write data 111 received from the 

input/output process (2) or (3) mentioned above is assigned processor 210 is written to the master disk Ao. As a result, 

(that is, a disk unit not executing any process), is referred to 45 the complete data is always held in the master disk Ao, even, 

as a disk unit in an inactive state. if for example, there is a breakdown in any one of the disk 

The load distribution control of the first type according to units Al through Ai other than the master disk and a power 

the present invention will now be described along with all outage affecting the cache 206. However, this arrangement 

features that are common to the load distribution control of results in a restriction in selecting the disk unit freely for thc 

the second type according to thc present invention. 50 write request received from the processor 210. Hence, the 

FIG. 1 is a block diagram illustrating the operation of the system performance is lowered as compared to a system 

control unit 203 in accordance with the first type of distri- wherein the write request can be allocated to an arbitrary 

bution of the present invention. In FIG. 1, there are a disk unit in the requested disk unit group without specifying 

plurality of master disk units, namely Ao, Bo and Co that are a master disk That is, the present invention has an advantage 

respectively a part of the disk unit groups 211A, 211B and 55 over such a system with respect to reliability, but has a 

211C. The difference between the master disk unit and the slightly reduced performance, e.g., speed. Specifically, in the 

other disk units is that the master disk unit is broadly a present invention when the control unit 203 receives the 

specific disk unit defined in advance in each of the disk unit write request 110 requiring access to the disk unit group 

groups 211 to more intensely receive the write requests, 211A, the control unit cannot start its processing unless the 

more specifically, to more intensely immediately receive the 60 master disk AO is inactive. 

write data directly in accordance with a write request with- FIG. 1 also illustrates the case where the control unit 203 

out passing the write data through the cache in a write after receives a read request from the processor 210 requiring 

process, and even more specifically the master disk imme- access to the disk unit group 2UC. At this time, the control 

diately receives the write data for all write requests to its unit 203 selects any one of the disk units 204 arbitrarily 
disk unit group, whereas the other diskunits of the same disk 65 (including an algorithm within the control unit), among the 
unit group receive the write data in a write after process from disk units 204 that are in an inactive state within the disk unit 

the cache. group 211C, which in the example of FIG. 1 involves the 
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wlection(e)of diskunit211C. The control unit203 transfers the disk unit group 2UA so that it can start ewoitine the 
T ^T*; 211 P to me »» d ** been received fromuVe?£Sr .to 
^ * C f * ^ * ^ 1,6 Ha 3 ' 1,46 ^ ^process (c) and the read ahead stage 
ZSf^^ff 1 ^^ 21 '" P"** 85 < a > 416 P crfonned ««* independently of the pro- 
ved not onty in the disk CI but also staged in the cache s cesser 210 and are each executed In parallel processing w£h 

broken iKSJEK^ J? " ^ " y 046 0th * Howevcr ' * are many inactiv^iSK 

^t^f"^, r gae ^ ta,aread . KqUeStf0rtheSame cooao1 ^ 203 «■ P 3 ^ the corresponding 

tSJUZZlfT* TE? *? m ~ e Cache 41 a ^ numbcr of ^ write after processes aVd read 

higher speed than it can be executed from the disk unit group ahead staging processes independently of the processor 21*. 

7^ . . __ , t 10 However, it is impossible to perform a write request that 

As shown in FIG. 1, there are input/output processes requires the master disk unit AO to be in an inactive state if 

between the disk unit groups and the cache 206 that are some other process, for example a read process, is already 

™„°? TJ l W ******* ° f *e being performed with reject to the master disk unit so that 

uiputfeutput request received from the processor 210. the master disk unit is not in an inactive state: this is a 
Specifically ^.ftcre is me wnte after prc<ess involving data 15 disadvantage of the first type of load distribution according 

flow (c) which writes wnte data to the disk units in an to theprcsent invention, which disadvantage is solvedbv the 

independent stage process independently executed from the second type of load distribution of the present invention as 

input/output request of the processor 210. Another example described hereinafter 

of the independent stage process performed separately from FIG. 4 and FIG. 5 illustrate parallel processing for plural 
the processor 210 is the execution of an advanced read from 20 read requests s ^j«ui<u 
the control unit 203 involving the inactive state disk unit FIG. 4 shows a plurality of processors 210, each con- 
selection process (e) and the advance read process (d). The nected to a single control unit 203, and specifically shown 
Tl 8 ^ 10 ^ 8 ^'^? t0W1 *te*«writedatalll are the processors 210 and 210a By waytf example, the 
storedu.ttecache206 to AediskunitselectedasAi, where control unit 203 receives from each of toe processors 2 W 
no write data 111 has been written yet. There is no need of 25 and 210a a read request that requires access to the disk unit 
performing a wnte after process with respect to the master group 211 A. Then the control unit 203 arbitrarily selects an 
disk AO in the disk unit group A because the write data Ul inactive disk unit for each, for example, disk unit AlandAi 
received from the processor 210 has already been directly among the disk units of disk unit group 211 A to start parallel 
wntten to the master disk unit Ao. Accordingly, with the processing the read requests that have been received, which 
exception of the master disk umtAo^e wnte after process 30 processing may involve staging (b)ofread data. As a matter 
£,c) is p^c^d sequentially with respect to each ofthe of course, if there is any inactive disk in the disk unit group 
other disk units, namely disk units Ai through Ai, with a 211A when the control unit 203 receives the read remie* 
sequence of «ecution not necessarily in that order. In the from the processor 210* the read request is immediately 
case where the cornel unit 203 executes a read ahead executed. If the master disk is inactive; the write request Z 
^g process (d, e) for a disk unit group 2UB indepen- 35 immediatdy executed. However, because of common 
Slv^^ "h I*' ^ C0n,rO - 203 to «>e master disk unit AO, it is impossible to 
I w , ^.TJ* m anlnaCllVe 543,6 a*"** P"f°nn a plurality of write requests, eaXof which 
all of the Ask un*s of the disk unit group 211B. In FIG. 1, requests the disk unit group 21LA. Also, if three or mvc 

t^XZl !■ iTt^ 1M , in fte 08(1,6 pl0CCSSaS " co™"** » fte ^ntto'l unttlo3, ft* 
206read from the disk Bj, byway of an example of the read <o possible to perform parallel three or more read request 

ahead staging process. respectively from the three or more processors where each 

Formewnterequestreccivedfromtbeprocessor210that request requires access to disk unit ^?2m so [ong^ 

Z^i^f « a * '? ft /™? s . ,to reUabait y * there are at least three or more inactive disk units among the 

obscured by selecting the master disk for immediate writing disk units 204 of disk unit group 211A 

l^tJ™ f*?' °. D *" 0thCr l ? aDd ' m 0,6 ^ of readin 6 45 FIG. 5 illustrates parallel processing with respect to one 

T^T'Ft k T/? 0U P' afl yf^ ve ^"^i»cluaing processor 210 connected to me control unit 2*11*5! 

{ U ^ t0tMB ^™ y ' m ^^.bywayofexa^le.canissueanewkpuJoJrS 

rehabflity and high performance are realized in a well request to the disk unit group 211A before the current 

b^ccd manner according to the first type of load distri- processing of the input/output request to the disk unitZI 

button of the present invention. 50 211A is completed. In FKJ. 5, by way of examplf the 

Thep^ntinveimonisparticiiUrlyappiicabletoparallel control unit 203 can be considered as in the middle of 

processing, as shown with respect to different parallel pro- executing with respect to disk unit A? a read ™t (a) 

cessing examples set forth in FIGS. 3 through 5. Also, all of received from the pressor 210 that rcq^« accS me 

™T2^7* D ™»t iS ?% Mypalaa * 1 disk unit group 21lTBefare nnishing^his ™dTq UC * (!) 

FIG. 3 illustrates the control unit 203 executing parallel 55 the control unit 203 receives another read request (b) from 

processing for the mput/output processes that arc: a first the processor 210 that involves access to \2?Z 

process tr^t requires access to the disk unit group Apursuant group 211A. The control unit 203 arbitrarily selects any one 

toa request from the processor 210; and a second process of me inactive disk units 204 of the disk LSffo^iuZ 

™ZiT^H° n FJ* C ° ntr ° 1 203 and * e CachC for ««*«« disk unitAi, to start this second recdved read 

f Jtt of * e ^ SS Z M *™ in FH3. «o request (b) before tie first read request (a) has bJ*££ 

f ^ 203 14 1X8X31161 P letd y execu,ed - Although not shown, the control un*21u 

™T*f a ^jj"*^ W with the disk unit Ai indepen- can also immediately start to process a write requesttua fe 

fullZT ^F^, Cd ^ ^ ^ OCeMOr ««*" me rMd (a) and (b) are'compS 

fud'ir^J ^^P^^^n decontrol unitandthe executed, whichwrite request requires access to thec^unft 

disk A2, and a read request (b) from the processor 210, all 65 group 211A only if the master disk unit Ao is macrive 

ofwh^chreq U1 reaecesstothediskunitgroup211A.m However.becaus^oftheeornpetitionLmern^^^ 

case, the control unit 203 selects an inactive disk unit Ai in it is impossible to paraUelexecute a pluS rf write 
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requests that each require access to the same disk unit group, 
for example disk unit group 2UA. 

Furthermore, even in the case where the single processor 
219 issues three ox more input/output requests including one 
write request all specifying the disk unit group 211A, the 
control unit 203 can parallel execute the requests if in each 
case there is an inactive disk unit, with the write request 
requiring the master disk unit to be inactive. 

Even though much of the following description of the first 
type of load distribution also relates to the second type of 
load distribution according to the present invention, the 
difference between the second type of load distribution and 
the first type of load distribution, each according to the 
present invention* wfll now be described. In the second type 
of distribution load, the read request from the processor 
requiring access to a specific disk unit group is executed 
with respect to any one of the inactive disk units other than 
the master disk unit. In a similar manner, a staging process 
that is performed independently of the processor 210 is 
executed with respect to any one of the inactive disk units 
other than the master disk unit 

. The reason why the second type of load distribution 
according to the present invention differs only in this manna 
from the first type of load distribution according to the 
present invention is that selecting a disk unit other than the 
master disk unit 204 for processes other than the write 
request allows the processing to be executed at a higher 
speed than with the first type of load distribution. This is 
possible, because then there will be less conflict between a 
write request and a process other than the write request, 
because the write request involves only the master disk unit 
(the write after can be performed at a later time) and the 
processing other than the write request is executed prefer- 
ably with respect to disk units other than the master disk 
unit, all of which increases the probability that parallel 
processing can be performed with less wait states. That is, 
according to the second type of load distribution according 
to the present invention, there is a higher possibility that the 
master disk is in an. inactive state when the write request 
requiring access to the disk unit group is received, which 
increases the possibility of parallel processing and increases 
the speed of the process. 

In FIG. 6, the following parallel processing is being 
accomplished as an example of the second type of load 
distribution according to the present invention. Write request 
110 from the processor 210 requiring disk unit group 211 A 45 
places write data 111 in the cache 206 of the control unit 203 
and also performs master disk selection (a) to thereby place 
the same write data in the master disk unit Ao. At the same 
time (including overlapping but not coincided execution 
cycles), according to parallel processing, a read request 113 
from the processor 210 requesting read data from the disk 
unit group 211C involves a selection (e) of disk unit Ci by 
the control unit 203 as an arbitrarily selected inactive disk 
unit among the disk units 204 other than the master disk unit 
Co to transfer read data to the processor 210 and as stage 
data 114 to the cache 206. Also according to parallel 
processing, staging conducting the read ahead process 
selects disk unit Bj by process (e) under control of control 
unit 203 to pass data along flow (d) to be stored in cache 206 
as stage data 114. In parallel with the above or at a later time, 
the control unit 203 can perform select (b) of disk unit Ai for 
the write after of write data 111. 

Unless otherwise indicated, the following description is 
applicable to both the first type and the second type of load 
distribution according to the present invention. 

FIG. 8 illustrates the structure of a disk unit 204. A 
plurality of rotatably driven coaxial disks 801 are provided 



in the illustrated disk unit 204. A read/write head 802 is 
provided for reading and writing data for each of the disks 
801 control unit interface 803 controls the operation, includ- 
ing movement, of the heads 802 with respect to the disks 
801. A unit of recording medium for each of the disks 801 
to which the read/write head 802 can gain access while the 
disk 801 completes one revolution is called a track 800. A 
plurality of tracks 800 are present on each disk 801. 

FIG. 9 illustrates the structure of a single track 800. The 
track 800 has its track head 902 and track tail 903 defined at 
certain fixed positions, as references. Also, one or more 
records 900 can reside on each track 800. A record 900 is a 
minimum unit of input/output processing between the pro- 
cessor 210 and the control unit 203. The position of the 
record 900 on the track 800 is determined by a unit of fixed 
byte length called a cell 901. The storage of a record 900 
must be started at a head of a cell 901 and it cannot be started 
from anywhere within the cell 901. Therefore the length of 
a record 900 is an integer multiple of the length of a cell 901. 
The niunbering of the cells 901 is in ascending order, one by 
one, beginning with the track head 902 of the track 900 as 
number 0. 

FIG. 10 illustrates the structure of the cache 206. The 
cache 206 may be DRAM or a portion thereof mapped to 
comprise segments 1000. In this embodiment, one segment 
1000 is assigned to one track 800, and the entire data in each 
track 800 is stored in a corresponding segment 1000. 
However, according to the present invention, the assigned 
unit of the segment 1000 is not necessarily limited to the 
entire track 800. A smaller unit, such as a record, which is 
a read/write unit between the processor 210 and the control 
unit 203, can also be adopted freely as the assigned unit. 

FIG. 11 illustrates the structure of the directory 208. The 
directory 208 comprises a plurality of segment management 
informations 1100, a track table 1101, and an empty segment 
35 head pointer 1102. Each segment management information 
1100 resides in the segment unit 1000. Each one of the track 
tables 1101 and empty segment pointer 1102 resides in the 
control unit 203. 
FIG. 12 shows the required information for the present 
40 invention, which is provided in each segment management 
information 1100. An empty segment pointer 1200 indicates 
the segment unit 1000 which is not used in the track 800. A 
cache track number 1201 is the number of the track 800 of 
the disk unit group 211 stored in the segment unit 1000 for 
the corresponding segment management information 1100. 
The record bit map 1202 shows the starting position of a 
record 900 on the track 800 stored in the segment 1000 for 
the corresponding segment management information 1100. 
Here the bit position is in the corresponding number of the 
starting cell 901. If, for example, the nth bit in the record bit 
map 1202 is on, the storing of the corresponding record 900 
is started at the nth cell 901, for the corresponding segment 
management information 1100. If the nth bit is off, a record 
900 stored starting at the nth cell 901 does not exist. 

FIG. 13 illustrates the storing format of the data on the 
track 800 in the disk unit 204 for data also in the cache 206. 
The structure shown in FIG. 13a is the same as that of FIG. 
9 that has already been described and which is also con- 
tained within the segment unit 1000. In the segment unit 
1000, shown in FIG. 13fe, the records 900 are sequentially 
recorded starting from the record at track head 902 on the 
track 800. Therefore, if the number of the cell 901 that stores 
the start of the record 900 on the track 800 is known, the 
storage starting position of the record 900 in the segment 
unit 1000 of the cache 206 is also known. 

With respect to FIG. 12, partially described above, an 
updated record bit map 1203 for a record 900 is stored in the 
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segment unit 1000 for the corresponding segment manage- Effective information is stored from the first disk unit 
ment information unit 1000, which requires a write after information head 1601 up to the number of the disk unit 
process. The record 900 that requires the write after process information 1601 defined by the disk unit number 1600. 
is hereinafter called a write after record. The respective bits Here the disk unit information head 1601 is information for 
reside in the corresponding number of the cell 901 as in the 5 the master disk. Also, the nth updated record bit map 1203 
case of the record bit map 1202. Specifically, if the nth bit of FIG. 12 in the segment management information 1200 is 
in the updated record bit map 1203 is on, the record 900, a disk unit 204 corresponding to the nth disk unit informa- 
storing of which is started at the nth cell 901 for the tion 1601. A processor input/output wait bit 1602 shows that 
corresponding segment management information 1100, is a an input/output request received by the corresponding disk 
write after record. A separate update record bit map 1203 is 10 unit group 211 from the processor is in the wait state. This 
r*ovidcd for each one of the disk units 204. The specific bit number can be expressed as foUows. The number of the 
relationship between an updated record bit map 1203 and a processor input/output wait bit 1602 equals the number of 
disk unit 204 will be referred to when the structure of the the processor 210 that can be connected to the control unit 
control information memory 207 is described. The areas for 203 (here the number is given as I) times the number of the 
the updated record bit map 1203 are prepared for the number 15 input/output process requests, (here the number is given as 
of the disk units 204 that can be defined in one disk unit J) that can be executed in parallel by one processor 210 for 
group 211. However, the number of the updated record bit one disk unit group 211. 

maps 1203 that can be used is the number of the disk units Therefore, when each processor 210 issues the input/ 
204 comprising the corresponding disk unit group. output request to the control unit 203, the processor 210 

In FIG. 12. the store completion flag 1204 shows whether 20 sends the following two points of information to the control 
or not the record 900 is stored on the assigned track 800 of unit 203. A first point of information is the identity of the 
the disk unit with respect to the record in the segment unit processor 210 that issues the input/output request, indicated 
1000 for the corresponding segment management informa- by one of the numbers from 1 to I that are respectively 
tion 1100. The active flag 1205 shows that the input output assigned to the processor 210. The second point of infor- 
process is being executed for the track 800 assigned to the 25 mation identifies for the specified disk unit group 211 the 
corresponding segment management information 1100. The input/output requests by one of the numbers 1 to J. 
segment pointer 1206 indicates a segment unit 1000 for the FIG. 17 illustrates the structure of one of the disk unit 
corresponding segment management information 1100. informations 1601 and the others have the same structure. A 

FIG. 14a illustrates the structure of the track table U01, disk unit number 1700 is given for identifying the disk unit 
and FIG. 146 indicates the structure of the inactive segment 30 204 far the corresponding disk unit information 160L A 
head pointer 1102. processor input/output execution single bit 1701 shows 

The track table 1101 shows whether or not each of the whether or not a disk unit 204 for the corresponding disk 
segment units 1000 is assigned to a track 800, for a set of unit number 1700 is active in executing an input/output 
tracks 800 in the same disk unit group 211. A track table request received from the processor 210. A write after 
1101 is provided for each of the disk unit groups 211. If 35 execution single bit 1702 shows whether or not a disk unit 
assigned, a pointer 1100A is set to provide the address in 204 for the corresponding disk unit number 1700 is active in 
memory where the segment management information is to executing a write after process. An independent staging 
be found. As shown for each of the segment units 1000 execution single bit 1703 shows whether or not a disk unit 
assigned to the track 800 if there is no segment management 204 for the corresponding disk unit number 1700 is active in 
information 1100, 1200, the pointer is reset, The track table 40 executing a staging process performed independently of the 
1101 has the information regarding tracks 800 in the same processor 210. For the processor input/output execution bit 
disk unit group 211, all stored in the numerical order of the 1701, write after execution bit 1702 and independent stage 
tracks 800, that is in ascending numerical order in the execution bit 1703, only one may be set on at a single time 
direction of the arrow shown in FIG. 14(a). The segment Also, a disk unit 204 for which the processor input/output 
management information 1160 for the corresponding seg- 45 execution bit 1701, write after execution bit 1702 and 
ment unit 1000 to which no track 800 has been assigned is independent staging execution bit 1703 are off is a disk unit 
all combined sequentially in storage at an address identified 204 in an inactive state. When one of these bits is on, the 
by the empty segment head pointer 1102. A set of the disk unit 204 is in an active state. Segment management 
unassigned combined segment management information information pointer 1704 indicates the address of the stored 
1100 is called an empty segment que 1400 shown in FIG. 50 segment management information 1100 assigned to the track 

800 accessed by an input/output process in execution by a 

FIG. 15 illustrates the structure of the control information disk unit 204 for the corresponding disk unit 1700. The 
memory 207. In the control information memory 207, disk segment management information pointer, when set, shows 
unit group information 1500 is included, which has infor- the address in storage for the segment management infor- 
mation for each disk unit group 211. The number of disk unit 55 mation for the disk unit 204 identified by disk unit number 
group informations 1500 corresponds to the number of the 1700. 

disk unit groups 211 that can be controlled by one control It is desirable that the control information memory 207 is 
UXS ^? 3 ' ... non-volatile, otherwise there is a problem that information 

KG. 16 illustrates the structure of one disk unit group stored in the control information memory 207 can be lost 
information 1500, which is the same for all. Disk unit 60 due to power outage and other reasons, 
number 1600 is the number of the disk units 204 currently Input/output processes to be executed by the control unit 
in the corresponding disk unit group 211. A plurality of the 203 are actually carried out in parallel by the respective 
disk unit informations 1601 are provided, respectively for directors 204 in the control unit 203. 
each of the disk units 204 comprising the corresponding disk FIG. 18 shows each procedure used by each of the 
unit group information 1500. The prepared number of disk 65 respective directors 204 for carrying out the required parallel 
unit informations 1601 is equal to the definable number of executions according to the present invention. Each function 
the disk units 204 defined in one disk unit group 21L of the procedures will be described. An input/output request 
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receipt part 1800 processes the input/output request received unit group 211, that is the head disk unit information 1601 

from the processor 210. A write after process schedule part in the disk unit group information 1500. In other words, the 

1801 provides a schedule for the write after process. An processor input/output execution bit 1701, write after execu- 

independent stage process schedule part 1802 provides a tion bit 1702, and independent staging execution bit 1703 

schedule for the staging performed independently of the s are all checked to determine if they are all inactive. Then, if 

processor 210. A disk unit transfer part 1803 executes the the master disk is found to be in the inactive state, corre- 

read/write transfer to and from the disk units 204. sponding to none of the above-mentioned bits being set 

FIG. 19 is a flowchart for the input/output receipt part active, the requested master disk is selected according to 

1800 of FIG. 18. When the input/output receive part 1800 step 1907. The selection is made in step 1907 by turning on 

receives a new input/output request from the processor 210, to the processor input/output execution bit 1701 in the disk unit 

this part starts its execution. Hie execution is as follows. information 1601 for the corresponding master disk When 

At step 1900, it is determined if an input/output request the above process is completed, processing proceeds to step 

that has been received requires access to the disk unit 204. 1910 to start the same processing that is already described 

To specifically define the type of an input/output request with respect to the first type of load distribution, 

which requires access to the disk unit 204 is not directly 15 If the determination in step 1906 is that the requested 

related to the present invention and therefore is omitted from master disk is active, processing proceeds to step 1913 

the detailed description. If the input/output request received where the corresponding input/output request is then kept in 

does not require access to the disk unit 204. the processing a wait state until it can be executed by turning off the active 

proceeds to the step 1918. If the answer to the determination flag 1205. 

in step 1900 is yes, step 1901 starts executing the input/ 20 If there is a negative determination from step 1905, that 

output request by first determining if the requested input/ is if there is a read request, processing proceeds to step 1908. 

output track resides in the cache. Spedfically, a track 800 In step 1908, a determination is made if there is any inactive 

which the input/output request wishes to gain access to, is disk unit found in the requested disk unit group 211, to start 

checked to determine whether or not a segment 1000 is the assignment of the read request which requires access to 

assigned thereto. If an assignment is made, processing 25 a disk unit 204 of the requested disk unit group 211. 

proceeds to step 1903. If no assignment is made as deter- According to the first load distribution type of the present 

mined by step 1901, step 1902 assigns segment management invention, an arbitrary disk unit which is in the inactive state 

information 1100 to the track 800 to which the input/output is assigned to the request which requires access to a disk unit 

request wishes to gain access and links it to the correspond- 204. Accordingly, the requested disk unit group 211 is 

ing area of the track table 1101. Also, the store completion so checked to determine if there is an empty disk unit 204 

flag 1204 for the assigned segment management information therein in accordance with step 1908. The specific contents 

1100 is turned off and the active flag 1205 is turned on. At of the processing are as follows. In other words, with respect 

this time, the segment management information 1100 of the to the disk unit informations 1601 for the requested disk unit 

assignment is one selected from the segment management groups 211, the processor input/output execution bit 1701, 

information's 1100 that arc in an inactive state starting at the 35 write after execution bit 1702 and independent stage execu- 

empty segment head que 1102. If there is no segment tion bit 1703 are searched to see if they are off. If any one 

management information 1100 in the inactive state, a seg- of these bits is on for each of the searched disk units, it 

ment management information currently assigned is selected means that there is no inactive disk unit 204 in the requested 

by a known method. Any specific method of the selection is disk unit groups 211, making it impossible to start executing 

not related to the present invention and thus will not be 40 the read process and therefore the answer to the determina- 

described in detail Following step 1902, step 1905 is tion in step 1908 is no and processing proceeds to step 1913 

executed. for the wait state until processing may be resumed. 

If the determination in step 1901 is yes, step 1903 makes If an inactive disk unit 204 is found in the requested disk 

the determination if the requested input/output track is in unit group 211, corresponding to a yes determination from 

use, that is the active flag 1205 for the segment management 45 step 1908, an inactive disk unit 204 of the corresponding 

information 1100 assigned to the track 800 to which the disk unit number 1700 in the corresponding disk unit 

input/output request gains access is checked to determine information 1601 is selected by step 1909. Specifically, the 

whether or not this flag 1205 is on. If it is on, the requested processor input/output execution bit 1701 in the selected 

track 800 is in use for some other input/output process, disk unit information 1601 thus found is turned on. 

therefore, the input/output request newly received cannot be so In step 1910, the selected segment management informa- 

executed immediately and hence processing proceeds to step tion pointer 1704 is set to indicate the segment management 

1916. If according to the determination in step 1903, it is information 1100 assigned to the requested track 800. In step 

found that the active flag 1205 is off, the active flag 1205 is 1911, a positioning process request is issued for the disk unit 

turned on according to step 1904 and processing proceeds to 204 selected according to step 1909 to access the requested 

step 1905. 55 track 800 with its head. In step 1912, connection between the 

In step 1906, it is determined if the input/output request director 205 in use and the processor 210 making the request 

is a write request According to the most preferred form of is cut off until the position process for the selected disk unit 

the present invention, a write request which requires access 204 is completed. Thereafter, the processing in the input 

to the disk unit group 211 gains access to only the master receipt part 1800 is terminated. 

disk unit Thus, if the input/output request is for reading, so As mentioned; step 1913 is conducted when no inactive 

e.g., the processing branches conditionally to step 1908 in disk unit 204 is found in the disk unit group 211 requested, 

accordance with a negative determination from step 1905. In step 1913, the active flag 1205 from the corresponding 

In the case of a write request, the master disk unit is segment management information 1100 is turned off. In 

checked at step 1906 to determine if the master disk unit in accordance with step 1914, a determination is made if the 

the requested disk unit group is inactive. This check exam- 65 store completion flag 1204 is on. If it is on, processing 

ines the following information in the disk unit information proceeds to step 1916. If the determination in step 1914 is 

1601 for the corresponding master disk of the requested disk no, the segment management information 1100 is registered 
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in step 1915 in an empty segment que 1400 because the off turns off the write after execution bit 1702 and the process- 
value of the store completion flag 1205 indicates that no data ing of the write after process schedule part 1801 is termi- 
is recorded in the segment 1000 corresponding to this nated. If the answer to the determination of step 2003 is yes, 
segment management information 1100. step 2005 will select the track 800 for the write after process. 

In step 1916, the corresponding input/output wait bit 1602 5 If there are a plurality of tracks 800 which can be used for 

in the disk unit group information 1500 is set to provide an executing the write after process, one must be selected, 

indication to the processor 210 that the process of the However, the selection of the track 800 among a plurality of 

corresponding input/output request cannot be started due to such tracks is not related to the present invention and its 

some other input/output process being carried out. description is omitted. 

Specifically, the bit position to be set in the process request 10 In step 2006, the active flag 1205 in the segment man- 
input/output wait bit 1602 is determined in view of the two agement information 1100 for the track 800 selected at the 
points given below, and the setting of the bit is performed step 2005 is turned on. At step 2007, pointer 1704 to the 
accordingly. A first point is to know the number of the segment management information 1100 assigned to the 
processor 210 which issued the corresponding input/output selected track 800 for the corresponding input/output request 
request from those processors 210 numbered from 1 to L A 15 is set At step 2008, a positioning process request is issued 
second point is to know subsequently the number of the to the disk unit 204 selected at the step 2001. After this, the 
input/output request from those numbered 1 to J issued to the processing of the write after process schedule 1801 is 
disk unit group 210 specified by the input/output request terminated. 

In the case where the segment management information FIG. 21 is a flowchart showing the independent stage 

1100 of the requested track 800, is being used by some other 20 process schedule part 1802, which executes during the time 

input/output request, it is not particularly necessary to mat me Sector 205 is in an inactive state. In step 2100, the 

manipulate the segment management information 1100. disk unit group 211 which executes a stage process inde- 

HnaUy, at step 1917, merequestfo^ pendently of the processor 210 is defined. This step is not 

mat me processing of me corresponding input/output request directly related to the present invention and accordingly a 

is in a wait state because the execution thereof cannot be 25 specific description is omitted. In step 2101, a determination 

started due to some other input/output process being is made if the disk unit group 211 found in step 2100 has a 

executed. After this, the processing in the input/output track 800 fear executing the staging independently of the 

receipt part 1800 is terminated. processor 210, This step is not directly related to the present 

If the answer to step 1900 is no, processing proceeds to invention and therefore its specific description is omitted. If 

step 1918. In step 1918, there is execution of a process 30 no track 800 is found in step 2101, the processing in the 

required for the input/output request which does not require independent staging schedule part 1802 is terminated. If the 

any access to the disk unit 204. The specific contents of this answer to the determination of step 2101 is yes, step 2102 

process is not directly related to the present invention is performed. In step 2102, a track 800 is selected for the 

therefore it is not described further in detail. Thereafter, the execution of the staging performed independently of the 

processing is ended. 35 processor 210. If there are a plurality of tracks 800 that can 

FIG. 20 is a flowchart showing the write after process be used to execute the stage process independently of the 

schedule part 1801. The right after process schedule part processor 210, a track 800 from among the plurality must be 

1801 executes during a time the director 205 is inactive. selected. However, the selection of a specific track 800 itself 

As shown in FIG. 20, step 2000 defines the disk unit is not related to the present invention so that the specific 

group 210 for the write after. Because the method of this step 40 description thereof is omitted. 

is not directly related to the present invention, the detailed In step 2103, a segment management information 1100 is 

description thereof is omitted. In step 2001, a determination assigned to the track 800 selected in the step 2102. The track 

is made if there is an inactive disk unit 204 other than the 800 which should be used for executing the staging inde- 

master disk unit found in the requested disk unit group 211. pendently of the processor 210 is not a track 800 staged in 

The specific contents of the process of this step 2001 is 45 the cache 206. This assignment method is the same as shown 

given. With the exception of the master disk, a disk unit and described previously with respect to step 1902. Also, the 

information 1601 in which the processor input/output execu- store completion flag 1204 in the assigned segment man- 

tion bit 1701 , write after execution bit 1702 and independent agement information 1100 is turned off; the active flag 1205 

stage execution bit 1703 are all off is searched for. If it is turned on. 

cannot be found, the negative result of seep 2001 indicates 50 In step 2104, it is determined if there is a disk unit that is 
that the write after process cannot be executed and process- in the inactive state in the disk unit group defined in the step 
ing proceeds to the end and the right after process schedule 2100 for the input/output processing. The specific process- 
part 1801 is terminated. If found, that is if there is an ing for step 2104 is the same as that of step 1908 and hence 
atlrmative answer to step 2001, the write after execution bit a specific description thereof will be omitted If the detcr- 
1702 in the disk unit information 1601 found with step 2001 55 mination of step 2104 is negative, that is if no inactive state 
is turned on with step 2002. disk unit is found, it is impossible to execute the staging 
In step 2003, a detennination is made if the disk unit 204 independently of the processor 210 and as a result the 
found in the step 2001 has any track 800 which can execute segment management information 1100 assigned at step 
the write after process. The specific check information is the 2103 is returned to the empty segment que 1400 and 
segment management information 1100 with the on-bit in 60 thereafter the processing in the independent stage process 
the updated record map 1203 for disk unit 204 selected from schedule 1802 is terminated. If the answer to the determi- 
the track table 1101. Also, it is necessary that such segment nation in step 2104 is yes, that is if an inactive disk unit is 
management information 1100 is not in use for some other found, step 2106 is performed to turn on the independent 
process request Thus, the active flag 1205 in the segment stage execution bit 1702 in the disk information 1601 found 
management information 1100 must be off. This is another 65 at the step 2103. 

condition required to execute the write after process. If the In step 2107, a pointer 1704 to the segment management 

answer to the determination in step 2003 is no, step 2004 information 1100 assigned to the selected track 800 by the 



06/02/2003, EAST Version: 1.03.0002 



5,680,574 

19 20 

corresponding input/output request is set. In step 2108 a case, it is also necessary to execute the following process 

positioning process request is issued to the disk unit 204 while identifying the number of the cell 901 in the stage 

selected in step 2100. Thereafter, the processing in the object record 990. First, the record 900 to be staged is also 

independent staging schedule 1802 is terminated. staged at a position corresponding to the identified cell 901 

FIG. 22 is a flowchart for the disk unit read/write transfer 5 in the segment 1000. Also, the bit position of the record bit 

part 1803 of FIG. 18. Execution by the disk read/write unit map 1202 corresponding to the number of the identified cell 

transfer part 1803 is started when the director 205 is 901 is turned on. Then, with the storing complfition flag 

informed that the positioning of the disk unit 204 is com- 1*204 turned on in step 2208, the processing proceeds to step 

pleted. 2215. 

In step 2200, a segment management information 1100 10 Step 2209 is reached from a no determination in step 
pointed to by the segment management information pointer 2203. In step 2209, a determination is made as a part of a 
1704 in the disk unit information 1601 for the corresponding read request received from the processor 210. In step 2209, 
disk unit 244 is selected. Hereinafter, a simple description of it is determined if the store completion flag 1204 in the 
the segment management information 1100 means the seg- subject segment management information 1100 is on. If the 
ment management information U00 selected at the step 15 determination in step 2209 is yes, processing proceeds to 
2200 unless otherwise specified. Also, a simple description step 2210 to transfer the requested record 900 to the pro- 
of the information in the segment management information cessor 210. The record 900 has already been stored in the 
1100 means the information in the segment management segment 1000. Consequently, in step 2210 the requested 
information 1100 selected at the step 2200 unless otherwise record 900 in step 2210 is transferred from the disk unit 204 
specified. 20 to the processor 210. Then the completion of the input/. 

In step 2201, a determination is made if the processor output process is communicated to the processor 210 and 

input/output execution bit 1701 in the disk information 1601 processing proceeds to step 2215. 

for the corresponding disk unit 204 is on. If the bit is not on, JS the ddermination in step 2209 is negative, that is if the 

it indicates that the input/output process currently in execu- store completion flag 1204 is off, the record 900 on the 
tion is not a process for the input/output request received 25 subject track 800 currently in execution is not staged in the 

from the processor 210 and processing proceeds to step segment 1000. Therefore, process step 2211 is executed. At 

221Z If the deterimnation in step 2201 is yes, that is if the step 2211, while being transferred to the processor 210 from 

execution bit 1701 is on, it indicates that the input/output the disk unit 204, the requested record 900 is staged in the 

process currently in execution is the process for the inpuV segment 1000 of cache 206. Also, in step 2211, it is 
output request received from the processor 210. 30 necessary to execute the process given below while identi- 

Accordingly, in step 2202, the completion of positioning is tying the number of a cell 901 of the stage record 900. Hrst, 

communicated to the processor 210 and processing contin- the record 900 to be staged in a segment 1000 is also written 

ues with connection of the control unit 203 to the processor. at a position corresponding to the identified cell 901. 

In step 2203, a determination is made if the input/output Further, the bit for the corresponding number of the identi- 

request received from the processor is a write request If the 35 fied cell 901 in the record bit map 1202 in the selected 

determination is no, that is, in the case of a read request, segment management information 1100 is turned on. After 

processing proceeds to step 2209. this the completion of the input/output process is commu- 

In the case of a write request, the data is received from the nicated in the processor 210. Subsequently, processing pro- 
processor 210 in step 2204 and written to the disk unit 204 ceeds to step 2207, which has been described above, to stage 
and in the segment 1000 assigned to the corresponding 40 the remaining record 900 in the subject track 800. 
segment management information 1100. However, the num- With the negative determination from step 2201, step 
ber of the cell 901 where the data in the request 1000 is to 2212 is reached. In step 2212, it is determined if the write 
be written should be identifted before the write execution after execution bit 1702 in the disk unit information 1601 for 
described above starts, because the data to be written in the the corresponding disk unit 204 is on. If the answer is no, 
segment 1000 must be written at a position corresponding to 45 that is if the bit is off, the processing proceeds to step 1214* 
the cell 901 thus identified. Further, the updated record bit to execute the stage process independently of the processor 
map 1203 for all the disk units 204 other than the master disk 210. In step 2214, all the records on the track are staged to 
unit in a segment management information 1100 is selected the cache and the corresponding record bit map is set and the 
In other words, the corresponding bit to the cell 901 iden- storage completion flag is turned on. 
tified as above in the updated record bit map 1203 is turned 50 If the detennination in step 2212 is yes, step 2213 will 
on. Hie write data received from the processor is also write the write after record to the disk unit All the identified 
thereby written to the cache; thereafter, the completion of the write after records are identified by the subject bit map 1203 
input/output process is communicated to the processor 210, in the defined segment management information 1100. After 

In step 2205, a determination is made if the storing this, the entire updated record bit map 1203 for the corre- 

compleuon flag 1204 is on, by checking the corresponding 55 sponding disk unit 204 is cleared, that is set to 0. 

segment management information 1100. If the detennination Subsequently, the processing proceeds to step 2215. 

is yes, processing proceeds to step 2215 because the records In step 2214 reached with a no answer from steps 2212, 

900 on the processed track 800 are staged in the segment the stage process is executed independently of the processor 

1000. If the answer is no indicating that the storing comple- 210. Specifically, all the records 900 on the subject track 800 

tion flag 1204 is off, the record 900 from the process track 60 are staged in the segment 1000 of the cache. It is also 

800 is not staged in the segment 1000. Consequently, the necessary to execute the following process while identifying 

process proceeds to step 2206. the number of the cell 901 in the stage record 900. First the 

In step 2206, the bit position of the record bit map 1202 record 900 to be staged in the segment 1000 is also written 

corresponding to the number of the cell 901 identified instep at a position corresponding to the identified cell 901. 

2204 is turned on. 65 Further, the following process is performed for the record bit 

In step 2207, the remaining record 900 and the track 800 map 1202 in the subject segment management information 

currently in execution to the segment 1000 is staged. In this 1100. In other words, the bit position of the record bit map 
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1202 corresponding to the identified cell 901 is turned on. In a disk unit 204 other than the master disk. Specifically the 

addition, the store completion flat 1204 is turned on. following information in the disk unit information 1601 for 

The texnunauon process proceeds from step 2215. In step each disk unit 204 other than the master disk is checked. In 

2215, reached from various other steps 2214, 2213, a yes other words, the processor input/output execution bit 1701 

answer fo>m step 2205, and step 2208. m step 2215, the 5 write after execution bit 1702 and independent stage execu- 

acuye ^1205 in the subject segment management infor- tion bit 1703 are checked to determine whether all the bits 

"T 01 ! ? * • ^ m * for mose Kit units other than the master disk unit 

In step 2216 all the bits in the process I/O execution bit If there is an inactive disk unit 204, a yes determination from 

1701, wnte after execution bits 1702 and independent stage step 2300, processing proceeds to step 1909 to select such 

execution bit 1703 in the disk unit information 1601 for the 10 disk unit 204, and then the same process as described with 

corresponding disk unit 204 are turned off, which will show respect to FIG. 19 proceeds. If there is no inactive disk unit 

the inactive state for the disk unit 204. 204, that is if the answer to the determination of step 2300 

Finally, in step 2217, the following process is executed to is no, the processing proceeds to step 1906 to check whether 

release the wait state of the input/output request for which the master disk is inactive, and the processing follows the 

the processor wait bit 1602 for the corresponding disk unit 15 process flow already shown and described with respect to 

group 211 is on. In other wards, the waiting states of all of HG. 19. 

the input/output requests defined by the processor 210 With the exception of the points given above, the process 

numbered 1 to I and the inputfoutput request numbers of 1 of FIG. 23 is exactly the same as that shown in FIG 19 and 

to J according to the bits being turned on are released. the description thereof is omitted 

Specifically, each of the processors is notified to issue its 20 FIG. 7 is the flowchart of the independent stage process 

input/output request. In summary, a search is made against schedule part 1802 with respect to the second type of load 

theinput/outoutwaitbitof me input/output request which is distribution of the present invention. The execution of the 

in a wait state for the corresponding disk unit group and the independent stage process schedule part 1802 for the second 

wait stateis released. After this, the processing in the disk type of load distribution is started the same as in the case of 

umt transfix part 1802 is terminated. 25 the first type of load distribution, already described 

As mentioned previously, the second type of load distri- Subsequently, the difference between the process flow 

buhon accOTOing to the present invention is the same as the shown in FIG. 21 with respect to the first distribution load 

above-described first type, except for specifically mentioned and the process flow shown in FIG. 7 will be described In 

differences. Some of these differences are given below. In this respect, the step numbers are identical where the con- 

the second type of load distribution, according to the present 30 tents of processes and the process flow of FIG 7 are exactly 

invention, a disk unit 204 other than the master disk is the same as those of the process flow in FIG. 21 

preferably selected for the read request requiring access to The process flow in FIG. 7 differs from that in FIG 21 in 

the disk unit group 211 and the staging is performed indc- the following points. First, when an inactive disk unit 204 is 

pendently ^of the processor 210. The respective data structure to be found subsequently to the step 2102, an inactive disk 

*^ m J ^*?^ Wghl7 ^ e ^^ tomcswondty ^ 35 ^204omermanmernasterdiskisfoundmmestep24^^ 

of load distabution without any change even though they in the case of the second type of load distribution according 

were described with respect to the first type of load distri- to the present invention. This is because of the preferred 

button. The modular structure shown in HG; 18 that is selection of the disk unit 204 other than the master disk unit 

necessary to executing the first type load distribution in in the second type of load distribution for the stage process 
arec** 205 <^ be adapted for the second type of load 40 performed independently of the processor 210. The specific 

(Hstnbuuon as it is. Although the respective process flow of process is the same as that for step 2300 and the description 

the modules in the input/output request receive part 1800 thereof is omitted. If there is an inactive disk 204 the 

and the maependeot stage process schedule part 1802 are processing proceeds to step 2104 to select such inactive disk 

^ghtiy d^crent from those in the first type of load unit 204, and the same process as in the first type of load 

attribution the process flow in the first type of load 45 distribution starts. If the disk unit 204 other than me master 

distortion for the other modular process is applicable disk unit is not in an inactive state, the master disk unit is 

wiftout modification to the second type of load distribution. checked in step 2401 to determine if it is inactive. The above 

FIG. 23 is a flowchart showing the input/output request process is the same as the process in step 1906 and therefore 

receive part 1800 in the second type of load distribution of the description thereof is omitted. If the master disk unit is 

the present invention. The execution of the input/output 50 inactive, the master disk unit is selected in step 2402 

request receive part 1800 is started as in the case of the Specifically, the processor input/output execution bit 1701 in 

previously desenbed first type of load distribution. the disk unit information 1601 for the carrcspondinc master 

^erefore oiuy the difference between the flowchart of FIG. disk is turned on. After this, the processing proceeds to step 

19 of the first load distribution type and the flowchart of FIG. 2107 to start the same processing as already explained with 

^^^^^^^^^^^^^^^and 55 respect to the first type of load distribution. If the masterdisk 

the smuianties will not be repeated In this respect, the step unit is not inactive, that is if it is active, the staging process 

^"I^f* 1 dena ^ wbCTCtne stents of theprocess steps performed independently of the processor 210 cannot be 

£ mr if C -r? aC ^ the SamC ? 0SC m ° *?° CCSS s** 5 executeA Hew*, to processing proceeds to step 2105 to 

« ?U ;™ C ^ C ?!! 0C m i* 00 " 5 flow of toc start me same P^css as already explained with respect to 

flowchart of ^ 23^^^ a me &st type of load distribution.^ 

is adopted in FIG. 23 instead of the step 1908 in FIG. 19. above points, the process flow in FIG. 7 and foe pro^ssflow 

In step 2300, a determination is made if any inactive disk in FIG. 21 are exactly the same, and the description of these 

unit other than the master disk is found In step 2300, the same parts will not be duplicated 

selection is preferably made to Pennine if the disk unit 204 According to the present invention, it is possible to obtain 

other than the master disk is in an inactive state. This is 65 a well balanced high performance and high reliability of a 

because the second type of load distribution, for a read control unit with a cache having a function of writing the 

request requiring access to the disk unit group 211, selects same data into all of the plurality of separate storages of a 
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disk unit group composing one or more disk units. This is 
because of the achievement of the distribution of an input/ 
output process between the disk units within a limit not 
losing any reliability and the improvement of the perfor- 
mance of an input/output process executable by the control 
unit brought about by the present invention. 

While a preferred embodiment has been set forth along 
with modifications and variations to show specific advanta- 
geous details of the present invention, further embodiments, 
modifications and variations are contemplated within the 
broader aspects of the present invention, all as set forth by 
the spirit and scope of the following claims. 

We claim: 

1. A control unit for processing input/output processes for 
a disk unit group of a master disk unit and other disk units, 
the disk unit group being operable under the control of a 
processor issuing read/write requests to the control unit for 
the disk unit group, said control unit comprising: 

write means for processing a write request from the 
processor to the disk unit group for always writing 
write data of any write request from the processor to the 
master disk unit of the disk unit group; 

after write means for thereafter writing the same write 
data to all the other disk units of the disk unit group; 

read means for processing a read request from the pro- 
cessor to the disk unit group for reading read data from 
any one of the disk units of the disk unit group that is 
selected by the control unit, and transferring the read 
data to the processor; and 

wherein each of said after write means and said read 
means examines an active/inactive state of the disk 
units and selects only an inactive one of the other disk 
units for the reading and transferring as a first priority 
and upon deterniining the active state for all the other 
disk units, then examines an active/inactive state of the 
master disk unit to complete the reading and transfer- 
ring with respect to the master disk unit when the 
master disk is in an inactive state as a second priority 
and to place the reading and transferring in a wait state 
when the master disk unit is in an active state. 

2. The control unit of claim 1, including: 
a cache; and 

staging means for stage processing reading and writing 
between the cache and any disk unit of the disk unit 
group independently of the processor. 

3. The control unit of claim 2, wherein said staging means 
includes read ahead means for performing a read ahead 
process transferring read data from a disk unit of said disk 
unit group to the cache independently of a current I/O 
request and in parallel with the writing, reading and trans- 
ferring of said write, after write and read means. 

4. The control unit of claim 1, including means for parallel 
processing the writing, reading and transferring of said 
write, after write and read means. 

5. The control unit of claim 1, wherein each of said means 
examines an active/inactive state of the disk units and selects 
only an inactive state disk unit for respective writing, 
reading and transferring. 

6. The control unit of claim 1, in combination with a 
plurality of the disk unit groups, with each disk unit group 
having a master disk and at least one other disk unit; and 

said write, after write and read means being inoperative 
for parallel processing with respect to each disk unit 
group. 

7. The control unit of claim 6, wherein each disk unit 
group has a plurality of other disk units. 
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8. The control unit of claim 1, in combination with a 
plurality of processors, all of said write, after write and read 
means parallel processing read/write requests from the pro- 
cessors. 

9. The control unit of claim 1, wherein each disk unit 
includes a plurality of rotatably mounted coaxial disks, at 
least one transducing head associated with each of the disks 
and movable relative to the disks for transducing the asso- 
ciated disk under control of said write, after write and read 
means. 

10. The control unit of claim 1, wherein said after write 
means includes a cache memory, writes the write data in the 
cache memory, reports completion of the write request to the 
processor, and thereafter writes the write data to the disk 
units other than the master disk unit 

11. A controller method for control of storage with respect 
to at least one storage group having plural storage units, 
comprising: 

selecting one of the storage units for a write request for 
the storage group received from a processor and com- 
pleting the write request with respect to the one of the 
storage units; and 
selecting a different one of the storage units with respect 
to said one storage unit as a read unit for a read request 
made to the storage group from the processor with said 
second mentioned selecting being based upon criteria 
that the read unit is arbitrary with respect to the read 
request and is an inactive storage unit, and completing 
the read request with respect to the read unit so that said 
30 second mentioned selecting and completing of the read 
request are parallel processed with respect to said first 
mentioned selecting and completing of the write 
request 

12. The method of claim 11, wherein said step of selecting 
35 for the read request has the further criteria that the selection 

is only made among the storage units in the storage group 
other than a specific storage unit having a fixed location 
within the storage group. 

13. A method for load distribution control of storage with 
respect to at least one storage group having a plurality of 
storage units, comprising: 

selecting a specific storage unit having a fixed location 
within the storage group for all write requests to the one 
storage group, based upon a selection criteria that the 
specific storage unit is inactive; 
completing the write request with respect to the selected 
specific storage unit by writing write data to the 
selected specific storage unit; 
reporting completion of the write request and thereafter 
writing the same write data to storage units of the one 
storage group other than the specific storage unit; 
selecting an arbitrary storage unit for read requests to the 
one storage group, with said selecting being indepen- 
dent of the read request and based upon criteria that the 
selected arbitrary storage unit is inactive and a part of 
the storage group identified by the read request from 
the processor, and completing the read request with 
respect to the selected arbitrary storage unit; and 
wherein said step of selecting for the read request further 
requires that the selection be made only among the 
storage units in the storage group other than the specific 
storage unit 

14. A method according to claim 13, including selecting 
the specific storage unit based upon satisfaction of the 
criteria that the selected specific storage unit is inactive and 
selected only after all of the other storage units have been 
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™ft£!£r ,!W 'VS d l ? dto "." f* 386 Unit ' predetermine! one disk unit in response to a read 

and thereafter completing the read request with respect to request from the processor; and 

the selected specific storage unit . , f. 

15. Acontrollermethodforcontrolof storage with respect means for reading read data, requested in the read 
to at least one storage group having one storage unit and , request, from said arbitrary one of said disk units and 
plural other storage units, comprising: transferring the read data to the processor. 

always selecting only the one storage unit for a write ^ **** ? ystem voiding to claim 19, wherein 

requestreceivedfromaprocessorfoVthe storage group «M ' P 1 ^. 0 * 

and completing the write request with respect to the T * . ^ unit group corresponding to a plurality 
selected one storage unit, and ,„ of rcquests from * e Processor and said read means 

SlSn^? 6 E"*! P^or with said 21.AWstc I agesy S temaccordingtoclaim20 wherein 

^5 If 8 ^.^mtem the selected said „ for JJ^ selects ^^taj one oS 

a^tay stoxage unit is arbitnuy with respect to the ^ units from ^ um f fi of gaid 2^°^£j 

processor read request and is an inactive storage unit, is said predetermined one disk unit ^ 

and completing the read request with respect to the a a:,i, ,* . - 

sdcctcdaibitrary storage unit co r£k * ^ * t0 ' 

16. A load distribution controller of storage with respect , 

to at least one storage group having a plurality of storage a C stora £ c; aod 

units, comprising: 20 P rc ^ ctch means for transferring prefetch data from the 

means for selecting one storage unit, arbitrarily with arbitrary one of said disk units to said cache storage . 

respect to an input/ouiput read request, with selection independently of the processor issuing a read request 

criteria being that the one storage unit is a storage unit J^S*!!™* Ctch daU * 

other than a specific storage unit having a fixed location . starage system accQrdin g to claim 22, wherein 

in the storage group and that the one storage unit is 25 rwdmeans reads read data requested in said read 

inactive; and requests from said cache storage when said read data exist 

means for thereafter completing the read request with ^^^IT^La * , • « v . 

respect to the one storage unit . A t contro1 mcthod of daun •»» wherein said reading 

17. The controller according to claim 16, including means StCP mdU ^ S a ptaHly of different read data from 
forselectingthespecfflcst^ 30 corresponding P^^y of different selected ones of the disk 
of criteria that the specific storage unit ^inactive ind Um * m rCSp ° nSC tD * ""^P^S plurality of 
selected only after all of the other storage units have been "^A^tZ^tt ft-™ u • 

found to be active, and thereafter completing the read ^control method of claim 23, whereon; 

request with respect to the specific storage unit. selecting selects independently of the processor issu- 

18. A load distribution controller of storage with respect 35 ** g ^ rea ? Tet i atsi to obt ain prefetch data, and 
to at least one storage group having a specific storage unit wherein said reading and transferring stores the 
and a plurality of other storage units comprising: prefetch data in the cache storage. 

means far selecting the , 26. A control method of claim 25, wherein said reading 

location within the one storage group forall write ? P T< g from * e cache stara * e 

requests to the one storage group, based upon selection 40 ^ . 7 "J* - ** s . tored m me unit group when 

criteria that the specific storage unit is inactive- requested read data exists in the cache storage. 

means for completing the write requests bv writing write Fl C ? Dtro1 m ? th ° d of a ^ stora S e s y stem including 

data to the^edfif storage unit? * * 3 T ^ * plmahty ° f Units ' of 

means for reporting completion of the write requests and TSSS^^i^A^ ^ ^ "H* 

f^^writing the same write data toV other 45 co^^ 

means for "ele^g an arbitrary storage unit that differs rcc ^ write data from a processor; 

from said specific storage unit for a readrequest to the Qg ^ WntC ^ int0 a master disk unit of ^ <*"k 

one storage group, with said selecting being indepen- Umt &0Up; 

dent of the read request and based upon criteria that the 50 thereafter writing the write data to all disk units of the disk 

arbitrary storage unit is inactive and a part of the unit Sr° u P o^er ^ c master disk unit; 

storage group identified by the read request from the receiving at least one read request from the processor; 

processor, and completing the read request with respect selecting at least one of the disk units in response to the 

to the arbitrary storage unit read request; and 

19. A disk storage system, comprising: 35 reading read data requested in the read request from the 

a disk unit group including a plurality of disk units, each one of the disk units other than said master disk unit 

being means for storing identical data; and transferring the read data to the processor, 

write means for writing write data into a predetermined 28 * A disk storage system comprising: 

one disk unit of said disk unit group in response to a go a disk unit group including a master disk unit and a 

write request from a processor; plurality of other disk units, each for storing identical 

after write means for writing the write data into each of data; 

said disk units other than said predetermined one disk a cache storage for storing a part of the data stored in said 

unit after said write means wrote the write data into said disk unit group* 

r*edeterrnined i one disk unit; 65 ^ means for ^ ^ ^ tfae ^ 

means for selecting an arbitrary one of said disk units umt in response to a write request issued by a oroces- 

from disk units of said disk unit group other than said sor, 
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after write means for writing the write data into all of said 
other disk units after said write means wrote the write 
data into said master disk unit; 

means for selecting at least one arbitrary disk unit from 
among the other disk units of said disk unit group other 
than said master disk unit in response to receipt of and 
independently of the content of a read request issued by 
the processor; 

prefetch means for transferring read data from the arbi- 
trary disk unit to said cache storage; and 

read means for reading the read data from said cache 
storage corresponding to a read request when the read 
data exists in said cache storage. 

29. A disk storage system according to claim 28, wherein 
said means for selecting selects a plurality of arbitrary disk 
units and said prefetch means transfers a plurality of read 
data from the selected plurality of arbitrary disk units to said 
cache storage in parallel. 

30. A control unit for processing read/write processes for 
a disk unit group, which includes a master disk unit and 
other disk units, and for operating in response to readAvrite 
requests issued by a processor to the disk unit group, said 
control unit comprising: 

write means for writing to the master disk write data, 
received from the processor in a write request issued to 
the disk unit group; 

after write means for writing the write data to all the other 
disk units after said write means wrote the write data to 
the master disk unit; and 

read means for transferring to the processor read data 
requested by the processor in a read request issued to 
the disk unit group, from an arbitrary disk unit in the 
disk unit group other than said master disk unit. 

31. The control unit of claim 30, further including 
multi read request receiving means for receiving at least 

two read requests prior to executing one of the at least 
two read requests; 
multi read disk unit select means for selecting at least two 
disk units from the disk unit group, corresponding in 
quantity to the number of the at least two read requests; 
and 

multi read means for transferring read data requested in 
the at least two read requests, from the at least two disk 
units in parallel. 

32. A control unit for processing read/write processes for 
a disk unit group which includes a master disk unit and other 
disk units, and for operating in response to read/write 
requests issued by a processor to the disk unit group, said 
control unit comprising: 

write means for writing to the master disk unit write data 

received from the processor in a write request issued to 

the disk unit group; 
after write means for writing the write data to all of the 

other disk units after said write means wrote the write 

data to the master disk unit; 
read disk select means for selecting a read disk unit from 

the other disk units other than said master disk unit, in 

response to a read request issued to the disk unit group; 

and 

read transfer means for transferring read data from the 
read disk unit to the processor. 

33. The control unit of claim 32, including 

multi read request receive means for receiving at least two 
read requests prior to executing one of the at least two 
read requests; 
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said read disk select means selecting a corresponding at 
least two of the other disk units, as read disk units to 
execute respective ones of the at least two read 
requests; and 

5 said read transfer means transferring corresponding read 
data requested in the at least two read requests, from 
each of the read disk units in parallel. 

34. A control unit far processing read/write processes for 
a disk unit group which includes a master disk unit and other 

10 disk units, and for operating in response to read/write 
requests issued by a processor to the disk unit group, said 
control unit comprising: 
a cache memory; 

write means for writing to the master disk unit write data 

15 received from the processor in a write request issued to 
the disk unit group; 
after write means for writing the write data to all of the 
other disk units after said write means has written the 

2Q write data to the master disk unit; and 

prefetch means for transferring prefetch data, which is not 
requested from the processor in a read request issued to 
the disk unit group, from an arbitrary disk unit in the 
disk unit group to said cache memory, wherein said 

25 prefetch means includes disk select means for selecting 
a prefetch disk unit from the other disk units before the 
read request is issued to the disk unit group; and means 
for transferring the prefetch data identified in the read 
request from the prefetch disk unit to said cache 

30 memory before the read request is issued. 

35. The control unit of claim 34, wherein said disk select 
means selects at least two prefetch disk units corresponding 
to at least two read requests; and 

said means for transferring transfers prefetch data from 
35 corresponding ones of said prefetch disk units to said 
cache memory in parallel. 

36. A control method of a control unit processing read/ 
write processes for a disk unit group which includes a master 
disk unit and other disk units, and which control unit is 

40 operable in response to read/write requests issued by a 
processor to the disk unit group, said control method com- 
prising the steps of: 

writing to the master disk write data received from the 
processor in a write request issued to the disk unit 
45 group; 

writing the write data to all the other disk units after said 

first mentioned writing; and 
transferring to the processor read data, requested from the 
processor in a read request issued to the disk unit group, 
50 from an arbitrary disk unit in the disk unit group other 
than said master disk unit 

37. The control method of claim 36, further including the 
steps of: 

55 receiving at least two read requests from the processor; 
selecting at least two of the disk units, corresponding in 

number to the number of read requests; and 
wherein said transferring transfers read data requested in 
the at least two read requests from the at least two disk 
50 units in parallel. 

38. A control method according to claim 36, wherein said 
transferring includes selecting a read disk unit from the other 
disk units to execute a read request issued to the disk unit 
group. 

65 39. The control method of claim 38, further including the 
steps of: 

receiving at least two read requests; 
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selecting at least two of the other disk units, to execute 
respective ones of the at least two read requests; and 

transferring read data requested in the at least two read 
requests from each of the at least two other disk units 
in parallel 5 

40. A control method of a control unit including a cache 
and processing readAvrite processes for a disk unit group 
which includes a master disk unit and other disk units and 
which control unit is operable under control of readAvrite 
requests issued by a processor to the disk unit group, said » 
control method comprising the steps of: 

writing to the master disk write data received from the 
processor in a write request issued to the disk unit 
group; 

writing the write data to all the other disk units after said 
first mentioned writing; 

transferring data, not requested from a processor in a read 
request issued to the disk unit group, from an arbitrary 
disk unit in the disk unit group to the cache. 20 

41. The control method of claim 40, further including the 
steps of: 

multi prefetch means for transferring prefetch data, which 
is not requested from the processor in a read request 
Issued to me disk unit group, from at least two arbitrary 25 
ones of the disk units to said cache in parallel 

42. A control method of a control unit including a cache 
and processing read/write processes for a disk unit group 
which includes a cache, master disk unit and other disk units 
and which control unit is operable under control of read/ 30 
write requests issued by a processor to the disk unit group, 
said control method comprising the steps of: 

writing to the master disk write data received from the 
processor in a write request issued to the disk unit 
.group; 35 

writing the write data to all of the other disk units after 

said first mentioned writing; 
selecting a prefetch disk unit from all the other disk units 

other than said master disk unit; and 
transferring prefetch data from the prefetch disk unit to 

the cache when the prefetch data is identified in a read 

request that has not yet issued from a processor. 

43. The control method of claim 42, further including the 
steps of: 

i . 45 
selecting at least two disk units from the other disk units, 
as prefetch disk units to transfer prefetch data which is 
not requested in read requests issued to the disk unit 
group; and 

transferring prefetch data from each of prefetch disk units so 
to the cache in parallel. 

44. A control unit for processing input/output processes 
for a disk unit group of disk units, the disk unit group being 
operable under the control of a processor issuing read/write 
requests to the control unit for the disk unit group, said 55 
control unit comprising: 
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write means for processing a write request from the 
processor to the disk unit group for writing write data 
of any write request from the processor to only one disk 
unit of the disk unit group and thereafter reporting write 
completion to the processor; and 
after write means for thereafter writing the same write 
data to all the other disk units of the disk unit group. 
45. A disk storage system, comprising: 
a disk unit group including a plurality of disk units, each 

being means for storing identical data; 
means for selecting an arbitrary one of said disk units 
from said disk unit group in response to a write request 
from a processor based upon criteria that the one disc 
unit is inactive; 
write means for writing write data of the write request into 
only the one disk unit of said disk unit group in 
response to the write request from the processor and 
thereafter reporting write completion to the processor; 
and * 

after write means for writing the write data into each of 
said disk units other than the one disk unit after said 
write means wrote the write data into the one disk unit 
and after said write means reported write completion. 

46. A controller method for control of storage with respect 
to at least one storage group having plural storage units 
comprising: ' 

selecting only one of the storage units for a write request 

received from a processor for the storage group; 
said selecting being based upon criteria that the selected 
storage unit is arbitrary with respect to the processor 
write request and is an inactive storage unit; 
thereafter completing the write request by writing to only 

the selected storage unit; 
thereafter reporting completion of the write request to the 

processor; and 
thereafter afterwriting to others of the storage units. 

47. A storage control unit for processing input/output 
processes for a storage unit group including a plurality of 
storage units, the storage control unit having a plurality of 
transfer control means for executing data transfer operations 
between the storage units and a plurality of processors under 
control of the processors issuing read/write requests to the 
control unit for the storage unit group, each of said transfer 
control means comprising: 

means for selecting, from the storage unit group, one of 
the storage units which is not perfenmng either a read 
or write operation, in response to a read request from 
one of the processors; and 
read means for reading read data, requested in the read 
request, from said one of the storage units and trans- 
ferring the read data to said one of the processors, 
wherein the reading and transferring of said read means 
are conducted in parallel for the plurality of processors. 
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